BIOTECHNOLOGY 
SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIQ detected errors when processing the following computer readable 
form: 

Application Serial Niraiber: 
Somx;e: 

Pate Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WTTH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin2 1 help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERROREP SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM. ACCESSIBLE THROUGH THE U.Sr^TENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http;//www.uspto.gov/web/ofnces/pac/checker 
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Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 
a possibility that the disk/GD-Rom may have been affected by treatment given to all incoming mail. 
Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-I^'b.. 
.,Any reply i ncl^iding a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 
United Stat es Patent and Trademark Office, and instead should'be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual- ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7**' Floor, Examiner Name, 
Sequence Infom^ation, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 

Or^ * • 
U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two 
2011 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence;Room lB03-Maih-oom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 ' 



Revised 01/29/2002 



Raw Sequence Lbtlnf Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENCUSH -ALPHA- HEADERS, WHICH ^XRE INSERTEQ^Y PTO SOFTWARE 

. 1 Wrapped Nuclcici TheRumbcrAe)(tUthecndofcachUne%Tappe(r<b)wntoihenextlIne/Thumayocc^ 

Wnpped Aminos wu retrieved in a word procesjor alter creating it Please adjust your right margin to J; this will 
prevent •Swapping." 



^Invalid Line Length The rules require that a line not exceed 71 characten in length. This includes white spaces. 



^Misaligned Amino 
Numbering 

_Non- ASCII 
_ Variable Length 



Patcntin 2.0 
•bug" 



The numbering under each 3* amino add is misaligned. Do not use tab codes between numbers;'^'^ 
use space characUrt, instead. 

The submitted file was not saved in ASCn(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved In ASOI tcxL ^ 

Sequence(s) oont^in n*s or Xaa*s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<723> seetioMhat some nuy be missing. 

A "bug- in Patcntin version 2.0 has caused the <220>-<223> section to be missing from amino add 

sequcnccs(s) . Normally, Patcntin would automatically generate this section from the 

previously coded nucleic add sequence. Please manually copy the relevant <220>-<223> section to 
the su^sequeitt amino add sequence. TMs applies to the mandatory <220>^<223> sections for 
Artificial or trnlciliWh sequences.' 



_Sldpped Sequences -Scquence(s} 
(OLD RULES) 



_ missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRlPTION:SEQ ID NOrX: (insert SEQ 1 D NO where **X*; is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



10 J_ 

11 1/ . 



_Skipped Sequences 
(NEW RULES) 



Use ofnU or Xaa*s 
'(NEW RULES) 



Sequencc(s) _ 



^ missing. If Intentional, please insert the following lines for each skipped sequence. 



Invalid <2 1 3> 
Response 
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Uscof<220> 



Patentin 2.0 
Dug" 




<210> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa*s have been detected in the Sequence Listing. 
. Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*f or Xaa's are present 
In <220> to <223> section, please explain location of n or Xaa; and which residue n or Xaa represents. 
- 

Per 1.823 ofSequence ifikles^ the only valid <213> responses are: Unknown, Artiftdal Sequence; or 
sdcntific name (Genus/spedesX <220>-<223> section is required when <2 1 3> response is Unknown or • i ^ 
^is Arti/icial-Sequence ■ - • ^. 

jequencc(s) ' O missinit the <220> "Feature'* and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <213> •X>rgam$m- response is -Aftifidal Sequence" or 

Unknown.* *, Please explain source of genetic material in <220> to <223> section. ' 

(See -Federal Register,^ 0dA)l/l998,VoL 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 



Please do not use *^py to Disk** (unction of Patentin version 2.0. This causes a corrupted file; 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequeiKe 
listing). Instead, please use **Filo Manager^ or any other manual means to'copy file to floppy disk. 



13 Misuse of n a can only be used to represent a single nodeoCldc in a nucleic acid sequence. N is not used to represent 

any vahie not tpediicaUy a nudeottde^ 



AMO^fH - Biotechnology Systems Branch - 08/21/2001 




<120> 
<130> 



<150> 
<151> 

<160> 

<170> 



APPLICANT : Samuel I. Achilefu 

Raghavan Rajagopalan 
Richard B. Dorshow 
Joseph E. Bugaj 
iDDIGNBIl: Mallinckrodt Inc. 



2^ 



1 





TITLE: Hydrophilic Cyanine Dyes 
DOCKET/FILE REFERENCE : MRD-66 



PRIOR APPLICATION NUMBER. 
FILING DATE: 2000-01-18 



US 09/484,319 



^NUMBER OF SEQUENCES; 8 
i SOFTWARE: FastSEQ for Windows Version 3.0 



Noi Ck}mpty 
Oomected Diskette Needed 



<210> 
<211> 
<212> 
<213> 



OEQ ID NOi l 
LENGTH : 8 
T YPE : PRT^ 




<221> MOD_RES . k ^ * 7) / ,y ; ^ 

Cys with an intramolecuTar dlsuitide bond) ^ T^^' /^^ 
j.^, - hPtyPPH two r.y.q amino aclds --<1_^ C 7 C^-V- ( 



*fr-<224> 



<223> 




^^400> ^ 9 EQ ID jJQ;^ l 

Tyr ^gc| 



Xaa = D-Phe 




^>2^ Xaa ^bbj Tyr ^gc) Lys Thr(1cbb) Thr 



<210>- 
<211> 
<212> 
<213> 



SEQ ID NO: 2 
LENGTH : 8 
■ TYPE : PRT 
ORGAN I OM .^^ joithetic, 



<221> 
<222> 



MOD_RE 




<223> 



Xfla ^ n-Php 



<224> = Cys with an intramolecular disulfide bond" 
\ between two Cys amino acids 



:400> 
Xaa (Xbb).'fyr( 





Xcfc) Lys Thr 




'zv>1. 



<210> 
<211> 



SEQ ID NO: 3 
■ LENGTH : 11 



<212> TYPE - PRT 

<213> 0RGANIDM: 6ynthetif 

V ^_ ^ ^ : >^ZZo7 

^ <221> MOD_RES a n 

<222> (i)(;rr(oT^ 

<400> CJ i]Q ID NO: 3 

Gly Ser Gly GXn Trp Ala Val Gly His Leu Met 
1 . - . 5 ' ' " 10 



<210> 6 EQ ID NO ;- 4 

<211> LENGTH : 11 

<212> TYPE: PRT 

<213> QRGANICM - ] (g^nthetic 



<221> MOD_RES 
<222> ( 1 )r:.(Op ciclgjL^' 



<400> .Sfi@-«^^1^4 

Gly Asp Gly Gin Trp Ala Val Gly His Leu Met 

1 - ^ 5 - \ ' 10 



<210> OEQ ID NO: 5 

<211> LENGTH : 8 

<212> TYPE : PRT 



<213> ORGANISM ^nthetic 



<221> MOD R ES . ^ 

<222> ( 1 )(^r7( oO^/^C^Ct) 

<400> - SEQ ID NO. 5 

Asp Tyr Met Gly Trp Met Asp Phe 

r " * 5 ' " 



<210> S EQ ID NO I 6 

<211> LENGTH : 8 

<212> TYPE : PRT 

<213> ORGANISM : |ynthetic 



MOD_RES_^ 



<221> 
<222> 

Asp Tyr mieNGly Trp^Nl^Asp Phe 
1 ' .5 




<210> 
<211> 
<212> 
<213> 



SEQ ID NQ:.7 
LENGTH: 8 
TYPE. PRT 

ORGANI S M: ^y n the t ic 



<221> 
<222> 



MOD REg_ . . 



r4 



D-Asp 



.SKQ UUNOr 7 



|xff) Tyr(Nle) G^y Trp ^le^SR Phe 



3 



\ 



3EQ ID NO : 8 
LENGTH : 8 
TYPE : PRT 




f223?-«3^^ = D-L 



^^)pro Arg Arg Pro Tyr lie Leu 



^3.2X7 Cl) 




<110> Smith. Joha; Smlth^cnc Inc. 

<120> CxARvplc of « ScQucncc Liscing 

<U0> Ol-jOOOOl 



<U0* 
<U 1 > 



<1 S0> 
<1S1 > 



PCT/EP9e/00001 
1998-12-31 



US 08/999.999 
1997-lO-lS 



<160> 



< \ 70> 



r^accntln vcrcion 2.0 



<?I0> 
<2M > 

<2 13 > 



I 

309 
DMA 

i'm ftimcc i urn sp. 



< 2?0> 
<22 1 > 

< 22 2 > 

< 300> 
<301 > 

< 302 > 

< 303 > 

< 304 > 

< 30S> 

< 30C> 

< 307 > 

< 300> 

< 309> 



CDS 
(279) 



. (389) 



Doc, Richdrd 

Isolocion ond Clio r*ici:cr i iti t i on of o Ccnc encoding o 

IVoicasc from I'd r.cimcc i um sp . 

Journal oi Cones 

1 

4 

1 -7 

1908-06-31 

1234SC 

I9O0-OC-31 



<<00> 1 

ogctQLogic tec tgcgt*^ cctcttctct cLgggciici coccctgct.^ i"» l cago tc tc 

ogggogagtg tctcgaccct cccccgccLt Lgcagctica coggcoggco ggcdggcagc 

cgotgcggc*! attgccggca gcgccacagg ctcticagcc aggcttoggg cgggctccgc 

C9cggcgc9g cggcccctct cgcgctcctc ccgcgcctci ccctcgctct ccLctcgctc 



60 
I ?0 
I 00 
240 




Annex Vll. pa^c 30 



Appendix 3, page 2 _ 

cgacccgdtt *gotc«9c*g gAggaooggg cagttegc «cg get tea *t9 tec 

Met V«l Scr Mcc The Scr 

1 S 

ctg tct etc Ada tgg ccc yga etc cgt tty ttt a*Ft tgc. ttg^ etc caa 

Ucu scr rhc Lys . Trp Pro Cly The Cys Leu Phe Vil Cys Leu Phc cin 
1.0 15 20 



tgt ccc *Aa gtc etc ccc tgt cdc tea tea ctg c«g ccg . ftAt ctt ^ )e9 

Cys Pro Ly$ rVal Leu Pro Cys His Scr Scr -Leu. Gin Pro Asn-^^u.-- 

25 30 V ' 35 \ 

. . — =\ - 

<2i 2> * : PRT * ^ 

<213> Paramecium sp. v-' 

<<00> 2 . 

net val Scr ncL Phc Scr Leu Scr Phc Lys Trp Pro Cly PIiq Cys Leu 
IS 10 IS 

Phc Veil Cys Leu Phc CIn Cys Pro Lys Vol Leu Pro Cys Mis Scr Scr 
20 25 JO 



Leu CIn 



Pro 
3V 



Asn l.ovj 



<210> 
<21 I > 
<212> 
<213> 



3 

1 1 

Arc i f iciol 



Sequence 



<220> 
< 2 2 3 > 



<<00> 
net Va I 
I 



Designed peptide bosed on siic ond polarity lo 
linker between ihc o I pha ond bcto chains of Pr 



o c I «^ s 
rotein xyz. 



3 

Asn 



Leu 



Clu 

S 



i*ro Met \iis TUf 



Clu 
10 



I le 



<210> 4 
<400> 4 
000 



(Atincx VIII follows) 



3.' 1 1 U •^tt 



L II 

['Sequence 
fermation in 



•tabic. The numeric id^tificr shall be used only in c 
Listing." The order and presentation of Che items of 
•Sequence Listing- shall conform to the arrangement given below. Each 
itcfn of information shall begin on a new line and shall begin with the 
numeric identifier enclosed in- angle brackets as shown.. The subraission 
of those itemi of information designated with an •'M" is mandatory*. The 
submission of those items of information designated with an •*0- is 
optional. Numctic identifiers <110> through <170> shall only, be set 
forth at the beginning of the -Sequence Listing." T^e f^'lbWing tjble 
illustrat-cs the numeric identifiers. ^ 



Che 



Numeric 
Identi ficr 

<110> 



Definition 
Applicant 



Comments and 
Format 

Preferably max. 
of 10 names; 
one name per line; 
preferable .format: 
Surname, Other/* 
Names and/or 
Initials 



Mandatory (M) 
Optional JO) 



or 



M 



<120> 



Title of 
Invention 



M 



<1 30> 



file Reference 



Personal file 
re f c rencc 



M_. when filed prior 
to assignment of 
appl . numl>cr 



< 1<0> 



Current Appl ica 
L i pn /dumber 



Spcci ty as : 

US 07/999, 999 or 

PCT/US9C/99999 



M, if a va i 1 abl c 



< 1 < 1 > 



Current fi 1 i ng 
Date 



Specify as: yyyy-mm-dd M, if available 



<1 S0> 



Prior Application Specify as: 
Number US 07/999,999 or 

PCT/US96/99999 



M, i f appl icabl e 
i ncl udc priori ty 
documents under 
35 use 1 19 and 
120 



< 1 5 1 > 



Prior Application Specify as: yyyy-nvn-dd M, if applicable 
riling Date 



<1C0> 



Number of SEQ ID Count includes 
WOs total number of 

SCO ID NOs 



<170> 



<210> 



Sof twa re . 



S CO ID NO : « ; 



Name of software used 
to create the 
Sequence Listing 

Response shall be an 
integer repre- 
senting the SCO 
I D NO shown 



<2 1 1 > 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
res idue s 



<2 1 2> 



Type 



<213> 



Organism 



v;/»cchcr presented 
sequence mole- 
cule is DWA, 
RKA, or PRT 
(protein) . I ( 
. A nucleotide 
sequence con- 
t«in5 both ONA 
and fU/A £rA9- 
ments, the 
type shall be 
. "DNA." In ad- 
dition, the 
combined DNA/ 
fWA molecule 
shall be further 
described in 
t»ic <220> to 
<223> feature 
section. /* 

Scientific name, 
i.e. Ccnus/spccics , 
Unknown or Artifi- 
cial Sequence. In 
addition, tlic 
"Unhnown" or 
"Art i f i cia 1 Se- 
quence" organisms 
stia 1 1 be further 
described in the 
<220> to <223> 
feature section. 



M 



<220> 



r ca tu re 



<22 1 > 



Namc/Kc y 



<222> 



Loca t i on 



Leave blank after 
<220>. <221-223> 
provide for a 
dc s c r i pL i on of 
poi ni: 5 of bio- 
1 og i cd 1 5 i gn i - 
ficancc in the 
sequence . 



Provide appropriate 
identifier for 
feature, pre- 
ferably f rom 

WlPO Standard 

ST.2^> (1998). 

Append IX 2 , 

Tables S and 6 

Spec I ( y loca t ion 
within sequence; 
where appropriate 
state number of 
first And last 
bases/amino acids 



M, under the 
following condi- 
t ions : i f "n, " 
"Xaa , " or a mod- 
i f i ed or unusua 1 
L-amino acid or 
modified base was 
used in a se- 
quence; i f ORGAN- 
ISM is "Artifi- 
cial Sequence" or 
"Un known " ; i f 
molecule is 
combined DNA/RNA. 

undc r the f oi - 
lowing conditions: 
if "n, " "Xaa , " or 
a mod i f i ed or un - ' 
usual L-amino 
acid or modified 
base was used in 
a sequence 

M, under the fol- 
lowing conditions : . 
if "n, " "Xaa. " or 
a mod i f i ed 6c un- 
usua I L-amino 
acid or mod i f i cd 



<223> 



Other Infor- 
ms c ion 



f cure 



Other relevant 

information; 

four lines maximum 



cqucncc 



under the fol - 
lowing conditions: 
if "n, -Xaa, " or 
a modified or un- 
usual L-amino Acid 
or modified base 

U5cd. in -a 
sequence; if 
ORGANISM 
15 -Artificial 
Sequence^ or 
••Unknown"; i-f^^ 
mol:cculc is com- 
bined DNA/nWA-^ - 



<300> 



<301> 



Publication 
Information 



Authors 



Leave blanh' 
after <300> '~ 

Preferably max 
of ten named 
authors of publi- 
cation; specify 
one name per line; 
preferable format : 
Surname, Other 
Names and/or 
Initials 



v O 



< 202> 

< 303 > 
<30< > 
<305> 
<30C> 
< 307 > 



< 300 > 



< 309> 



<3 10> 



Title 
Journal 
Vol umc 
I ssuc 
Pages 
Da tc 



Da taba so 
Acccs s i on 
Numbc c 



Database Entry 
Da te 



Patent Document 
Numbe r 



O 
O 
O 



Journal date on which 
da ta publ i shed; 
specify as yyyy-mm- 
dd, r<MM-yyyy or 
Season- yyyy 

Accession numljcr 
assigned by data- 
base including 
database name 

Date of entry in 
da t aba sc ; spcci f y 
as yyyy-mm-dd Of 
HHH ' yyyy 

Document number; 
for patent-type 
citations only. 
Spec i f y as, for 
example. US 
07/999. 999 



<3 1 1 > 



<312> 



Pa ten t 
Da cc 



Publicacion Date 



<313> 



<400> 



Relevant 
Residues 

Sequence 



Oocumcnc filing 
dace, for pa cent - 
type citations only; 
specify ^5 yyyy'tr\n\-<id 

Document publication 
date, for 
patent-type 
citdti ns only; 
specify as yyyytm-dd^ 

FROH (position) TO 
(posi tion) 

SCO ID NO should 

follow the 

numeric identifier 
and should appear 
on the 
ceding the 
sequence 



O . 



A 



line pre- 
actual 



S. Section 1,024 is revised to read as follows: 

1.024 Form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



(a) The computer readable form required by 
fol lowing speci f i ca t i ons : 



1.021 (c) sha 1 1 meet the 



(I) The coniputcr icadabr6 form sholl conioin a single "Sequence Listing' 
as cither a dislicttc, scries of diskettes, or other permissible incdi*^ 
outlined in paragraph (c") of this section. 

(?) The "Sequence Listing".' in paragraph [a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No otJjcr formats shall be allowed. 



(3) T)ic computer readable form may be created by any means, suc)i a: 
processors, nucleotide/amino acid sequence editors or other custom 
computer programs; however, it sl»all conform to all specifications 
detailed in this section. 



wo rd 



(<) rile compression is acceptable wlicn using disl:ette media, so long as 
t)»c compressed file is in a se 1 f - c>: t rac t i ng format that will decompress 
on one of tlie systems described in ()aragrap)i (b) of this section. 

(S) Page numJ>ering shall not appcAi within t)ic computer readable form 
version of ihc "Sequence Li s t i ng " file. 

(Cl All computer readable forms shall l>avc a label permanently affixed 
t)icrcto on wh i cl» l»as been l»and-p r i n t cd or typed: tlic name of the 
applicant, t)ie title of the invention. t)»e date on which the data were 
recorded on the computer readable form, the operating system used, a 
reference numJ>er. and an application serial number and filing date if 
known. 

(bi Computer readable form submissions must meet these format 
r equ i remcnt s : 



•II Computer: I DM PC/XT/AT. or compo i i b 1 cs . or Apple Macintosh; 



(2» Operating System: MS-DOS. 



Unix or Macintosh; 



